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METHOD OF SCREENING FOR IMPROVED SPECIFIC ACTIVITY OF ENZYMES 



Field of Invention 



5 This Invention relates to a method for screening libraries of enzyme variants for 

changes in specific activity by expression of a fusion protein consisting of at least two 
enzymes. By using one enzyme as a marker changes in specific activity for the other enzyme 
can be screened efficiently. 



1 0 Background of the Invention 



Many methods of saeening for improved characteristics of proteins, e.g. enzymes, 
have been reported. One property of enzymes which it Is desirable to improve Is the specific 
activity. Technologies such as DNA-shuffling and random or site-directed mutagenesis have 
15 allowed the production of large numbers of variants in a short time. It is therefore desirable to a 
method that allows for fast, eventually high through-put, screening of enzymes with modified 
specific activity. 



Summary of the invention 



20 



The problem to be solved by the present invention is to provide a method for perform 
screening for altered specific activity of an enzyme. 

The problem arises because it is difficult to define the enzyme protein amount - and 
consequently to determine activity per milligram of enzyme protein - In the host cell culture 
25 supernatant without a purification process. To overcome this problem a method has been 
developed comprising the steps of 

(i) generating a library of nucleic acid sequences encoding enzyme variants of 
Interest 

(ii) providing a n ucleic a dd s equence e needing a n e nzyme t o b e fused w Ith t he 
30 enzyme in (1) 

(Hi) fusing nucleic acid sequence encoding enzyme variants in (I) with nucleic acid 
sequence encoding enzyme in (Ii) 

(iv) transfomiing the fused nucleic add sequence obtained In (ill) into a host cell 

(v) culturing host cell in (iv) in order to express the fused en^mes 
35 (vi) sampling each cell culture obtained In (v) 

(vil) analyzing samples obtained in (vi) by determining activity ratio of the expressed 
fused enzymes 

(vlii) selecting the samples exhibiting the desired activity ratio. 
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Definitions 

Prior to a discussion of the detailed embodiments of the invention, a definition of 
5 speciflc temns related to the main aspects of the invention is provided. 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DMA techniques within the sidll of the art. 
Such techniques are explained fully In the literature. See, e.g., Sambrook, Fritsch & Manlatls. 

10 Molecular Cloning: A Laboratory ly/lanuai. Second Edition (1989) Cold Spring Harbor 
Laboratory Press. Cold Spring Harbor, New York (herein "Sambrook et ai., 1989") DNA 
Cloning: A Practical Approach, Volumes I and II /D.N. Glover ed. 1985); Oligonucleotide 
Synthesis {MJ. Gait ed. 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. HIggins eds 
(1985)); Transcription And Translation (B.D. Hames & S.J. HIggins, eds. (1984)); Animal Cell 

15 Culture (R.I. Freshney. ed. (1986)); Immobilized Cells And Enzymes (IRL Press, (1986)); B. 
Perbal. A Practical Guide To IVIolecutar Cloning (1 984). 

When applied to a protein, the temn "isolated" indicates that the protein is found in a 
condition other than Its native environment. In a prefenred fomn, the isolated protein Is 

20 substantially free of other proteins, i.e. more than 95% pure, more preferably more than 99% 
pure. When applied to a polynucleotide molecule, the tenti "isolated" indicates that the 
molecule is removed from its natural genetic milieu, and is thus free of other extraneous or 
unwanted coding sequences, and is in a fonn suitable for use within genetically engineered 
protein production systems. Such isolated molecules are those that are separated from their 

25 natural environment and include cDNA and genomic clones. Isolated DNA molecules of the 
present invention are free of other genes with which they are ordinarily associated, and may 
include naturally occurring 5' and 3' untranslated regions such as promoters and terminators. 
The identification of associated regions will be evident to one of ordinary skill in the art (see for 
example, Dynan and Tijan, Nature 316: 774-78, 1985). 

30 

A " polynucleotide" i s a single- o r d ouble-stranded p olymer o f d eoxyribonucleotlde o r 
ribonucleotide bases read from tiie 5' to the 3' end. Polynucleotides include RNA and DNA. 
and may be isolated from natural sources, synthesized in vitro, or prepared from a combination 
of natural and synthetic molecules. 

35 

A "nucleic acid molecule" refers to ttie phosphate ester polymeric forni of 
ribonucteosides (adenosine, guanoslne, uridine or (^dlne; "RNA molecules") or 
deoxyribonucleosides (deoxyadenoslne, deoxyguanoslne, deoxythymidine, or deoxycytidine; 

2 
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"DMA molecules") In either single stranded forni, or a double-stranded helix. Double stranded 
DNA-DNA, D NA-RNA a nd R NA-RN A h slices a re p osslble. T he t enm n udelc a dd m olecule, 
and In particular DNA or RNA molecule, refers only to the prinrary and secondary structure of 
the molecule, and does not limit It to any particular tertiary or quaternary fonms. Thus, this tenm 

5 Indudes double-stranded DNA found, inter alia, in linear or drcular DNA molecules (e.g., 
restriction fragments), plasmlds, and chromosomes. In discussing the structure of particular 
double-stranded DNA molecules, sequences may be described herein according to the normal 
convention of giving only the sequence In the 5' to 3' direction along the non-transcribed strand 
of DNA (i.e., the strand having a sequence homologous to the mRNA). A "recombinant DNA 

10 molecule" is a DNA molecule that has undergone a molecular biological manipulation. 

A DNA "coding sequence" Is a double-stranded DNA sequence, which is transcribed 
and translated into a polypeptide In a cell In vitro or In vivo when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are determined by 

15 a start codon at the 5' (amino) terminus and a translation stop codon at the 3' (carboxyl) 
temiinus. A coding sequence can include, but is not limited to, prokaryotic sequences, cDNA 
from eukaryotlc mRNA, genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and 
even synthetic DNA sequences. If the coding sequence is intended for expression in a 
eukaryotic ceil, a polyadenytation signal and transcription termination sequence will usually be 

20 located 3' to the coding sequence. 

A "gene" refers a nucleic acid sequence encoding a peptide, a polypeptide or a protein. 

An "Expression vector" is a DNA molecule, linear or circular, that comprises a segment 
25 encoding a polypeptide of Interest operabiy linked to additional segments that provide for its 
transcription. Such additional segments may include promoter and terminator sequences, and 
optionally one or more origins of replication, one or more selectable mari<ers, an enhancer, a 
polyadenyiation signal, and the \ike. Expression vectors are generally derived from plasmid or 
viral DNA, or may contain elements of both. 

30 

Transcriptional and translational control sequences are DNA regulatory sequences, 
such as promoters, enhancers, temiinators, and the like, that provide for the expression of a 
coding sequence in a host cell. In eukaryotic cells, polyadenyiation signals are control 
sequences. 

35 

A "secretory signal sequence" Is a DNA sequence that encodes a polypeptide (a 
"secretory peptide") that, as a component of a larger polypeptide, directs the larger polypeptide 
through a secretory pathway of a cell in which it is synthesized. The larger polypeptide is 

3 



wo 200S/005654 PCT/DK2004/00049S 

oommonly cleaved to remove the secretory peptide during transit through the secretory 
pathway. 



The term "promoter* Is used herein for its art-recognized meaning to denote a portion of 
5 a gene containing DNA sequences that provide for the binding of RNA polymerase and 
initiation of transcription. Promoter sequences are commonly, but not always, found In the 5' 
non-coding regions of genes. 

"Operabiy linlced'', when referring to DNA segments, indicates that the segments are 
10 anranged so that they function in concert for their intended purposes, e.g. transcription Initiates 
in the promoter and proceeds through the coding segment to the terminator. 

A coding sequence Is "under the control" of transcriptional and translational control 
sequences In a cell when RNA polymerase transcribes the coding sequence into mRNA, which 
15 is then trans-RNA spliced and translated into the protein encoded by the coding sequence. 

''Isolated polypeptide" is a polypeptide which Is essentially free of other non-[enzyme] 
polypeptides, e.g., at least about 20% pure, preferably at least about 40% pure, more 
preferably about 60% pure, even more preferably about 80% pure, most preferably about 90% 
20 pure, and even most preferably about 95% pure, as determined by SDS-PAGE. 

"Heterologous" DNA refers to DNA not naturally located In the cell, or in a chromosomal 
site of the cell. Preferably, the heterologous DNA includes a gene foreign to the cell. 

25 A cell has been "transfected" by exogenous or heterologous DNA when such DNA has 

been Introduced inside the cell. A cell has been "transformed" by exogenous or heterologous 
DNA when the transfected DNA effects a phenotypic change. Preferably, the transforming 
DNA should be integrated (covalently linked) into chromosomal DNA making up the genome of 
the ceil. 

30 

"Homologous recombination" refers to the 1 nsertion of a foreign D NA sequence of a 
vector in a chromosome. Preferably, the vector targets a specific chromosomal site for 
homologous recombination. For specific homologous recombination, the vector will contain 
sufficiently long regions of homology to sequences of the chromosome to allow complementary 
35 binding and incorporation of the vector into the chromosome. Longer regions of homology, and 
greater degrees of sequence similarity, may increase the efficiency of homologous 
recombination. 
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''Specific activity" of an enzyme is activity unit per milligram of enzyme protein. 



A ''library" is a collection of entitles having a common feature, e.g. a collection of 
nucleotide sequences encoding (different) enzymes. 

5 

Tlie term "randomized library" of protein variants refers to a library with at least partially 
randomized composition of the members, e.g. protein variants. 

The term "functionality" of protein variants refers to e.g. enzymatic activity, binding to a 
ilgand or receptor, stimulation of a cellular response (e.g. 3IH-thymidine incorporation as 
1 0 response to a mitogenlc factor), or anti-microbiai activity. 

By the term "specific polyclonal antibodies" Is meant polyclonal antibodies isolated 
according to their specificity for a certain antigen, e.g. the protein backbone. 

15 "Spliced mutagenesis" is a fomi of site-directed mutagenesis, in which the primers used 

have been synthesized using mixtures of oligonucleotides at one or more positions. 

Detailed description of the Invention 

20 The present Invention relates to a method for screening enzyme variants for improved 

specific activity. The specific activity Is altered by generating enzyme variants starting from a 
protein bacl<bone, typically an enzyme. The improved specific activity of the enzyme variant 
may either be higher or lower than the specific activity found in the parent enzyme, i.e. protein 
baclcbone that has been modified, depending on the application of the enzyme variant. 

25 Changes In specific activity of the generated enzyme variants are difficult to monitor 

without applying a purification step due to the presence of other proteins in the host cell culture 
supernatant. This problem has been solved in the present invention by constructing a fusion 
protein which consists of two enzymes. One of the enzymes in the fusion protein is the enzyme 
variant with changed specific activity, and the other enzyme is an enzyme with known specific 

30 activity (herein after the mariner enzyme). Choice of mariner enzyme depends on enzyme 
variant as the two enzymes preferably have no overlap in the analytical signal produced. 

The present Invention comprises the steps of: 

(I) generating a library of nucleic acid sequences encoding enzyme variants of 
35 interest 

(il) providing a nucleic acid sequence encoding a mariner enzyme to be fused with 
the enzyme in (I) 
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(HI) fusing nudeic acid sequence encoding enzyme variants in (I) with nucleic acid 
sequence encoding enzyme in (11) 

(iv) transfonning the fused nucleic sequence obtained In (ill) into a host cell 

(v) culturing host cell In (iv) in order to express the fused enzymes 

(vi) sampling each cell culture obtained In (v) 

(vll) analyzing samples obtained in (vi) by detemnining activity ratio of the expressed 
fused enzymes 

(vlli) selecting the samples exhibiting the desired activity ratio. 

Nucleic Acid Sequence 

The techniques used to Isolate or clone a nucleic acid sequence encoding a 
polypeptide are Icnown In the art and include isolation from genomic DNA, preparation from 
cDNA, or a combination thereof. The cloning of the nucleic acid sequences of the present 
Invention from such genomic DNA can be effected, e.g., by using the well known polymerase 
chain reaction (PGR) or antibody screening of expression libraries to detect cloned DNA 
fragments with shared structural features. See e.g. Innis et al.. 1990, A Guide to Methods and 
Application, Academic Press, New York. Other nucleic acid amplification procedures such as 
iigase chain reaction (LCR), llgated activated transcription (l-AT) and nucleic acid sequence- 
based amplification (NASBA) may be used. The nucleic acid sequence may be cloned from a 
strain producing the polypeptide, or from another related organism and thus, for example, may 
be an allelic or species variant of the polypeptide encoding region of the nucleic acid 
sequence. 

The term ''isolated" nucleic acid sequence as used herein refers to a nucleic acid 
sequence which is essentially free of other nucleic acid sequences, e.g., at least about 20% 
pure, preferably at least about 40% pure, more preferably about 60% pure, even more 
preferably about 80% pure, most preferably about 90% pure, and even most preferably about 
95% pure, as determined by agarose gel electorphoresis. For example, an isolated nucleic 
acid sequence can be obtained by standard cloning procedures used in genetic engineering to 
relocate the nucleic acid sequence from its natural location to a different site where it will be 
reproduced. The cloning procedures may involve excision and isolation of a desired nucleic 
acid fragment comprising the nucleic acid sequence encoding the polypeptide, insertion of the 
fragment into a vector molecule, and incorporation of the recombinant vector Into a host cell 
where multiple copies or clones of the nucleic acid sequence will be replicated. The nucleic 
acid sequence may be of genomic, cDNA, RNA, semisynthetic, synthetic origin, or any 
combinations thereof. 



Nucleic Add Construct 
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As used herein the term "rtudeic acid construcT is intended to indicate any nucleic acid 
molecule of cDNA. genomic DNA, synthetic DNA or RNA origin. The tenn "construcT Is 
Intended to indicate a nucleic add segment which may be single- or double-stranded, and 
which may be based on a complete or partial naturally occuning nudeotide sequence 
5 encoding a polypeptide of Interest The construct may optionally contain other nudeic acid 
segments. 

The DNA of interest may suitably be of genomic or cDNA origin, for instance obtained 
by preparing a genomic or cDNA library and screening for DNA sequences coding for all or 
10 part of the polypeptide by hybridization using synthetic ollgonudeotlde probes In accordance 
with standard techniques (cf. Sambrool( et al., supra). 

The nudeic acid construct may also be prepared synthetically by established standard 
methods, e.g. the phosphoamidlte method described by Beaucage and Caruthers. 
15 Tetrahedron Letters 22 (1981), 1859 - 1869, or the method described by i\/latthes et al., ElVIBO 
Journal 3 (1984). 801 - 805. According to the phosphoamidlte method, ollgonudeotides are 
synthesized, e.g. in an automatic DIMA synthesizer, purified, annealed, ligated and cloned In 
suitable vectors. 

20 Furthemnore, the nucleic add construct may be of mixed synthetic and genomic, mixed 

synthetic and cDNA or mixed genomic and cDNA origin prepared by llgating fragments of 
synthetic, genomic or cDi^ origin (as appropriate), the fragments con-esponding to various 
parts of the entire nucleic add construct, In accordance with standard techniques. 

25 The nudeic acid construct may also be prepared by polymerase chain reaction using 

specific primers, for instance as described In US 4.683.202 or Sail<i et al.. Sdence 239 (1988). 
487-491. 

The temn nudeic acid construct may be synonymous with the temn expression cassette 
30 when the nucleic acid construct contains all the control sequences required for expression of a 
coding sequence of the present invention. The term "coding sequence" as defined herein is a 
sequence which is transcribed into mRNA and translated into a polypeptide of the present 
Invention when placed under the control of the above mentioned control sequences. The 
boundaries of the coding sequence are generally detemnined by a translation start codon ATG 
35 at the 5'-temninus and a translation stop codon at the 3'-tenninus. A coding sequence can 
Indude, but is not limited to, DNA, cDNA, and recombinant nucleic acid sequences. 

The term "control sequences" is defined herein to include all components which are 
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necessary or advantageous for expression of the coding sequence of the nucleic add 
sequence. Each control sequence may be native or foreign to the nucleic acid sequence 
encoding the polypeptide. Such control sequences include, but are not limited to, a leader, a 
polyadenylation sequence, a propeptide sequence, a promoter, a signal sequence, and a 
transcription tenmlnator. At a minimum, the control sequences include a promoter, and 
transcriptional and transiatlonal stop signals. The control sequences may be provided with 
linkers for the purpose of introducing spedfic restriction sites fadiitating ligation of the control 
sequences with the coding region of the nudelc add sequence encoding a polypeptide. 

The control sequence may be an appropriate promoter sequence, a nucleic acid 
sequence which Is recognized by a host cell for expression of the nucleic add sequence. The 
promoter sequence contains transcription and translation control sequences which mediate the 
expression of the polypeptide. The promoter may be any nucleic acid sequence which shows 
transcriptional activity in the host cell of choice and may be obtained from genes encoding 
extracellular or intracellular polypeptides either homologous or heterologous to the host cell. 

The control sequence may also be a suitable transcription terminator sequence, a 
sequence recognized by a host cell to temninate transcription. The temninator sequence Is 
operably linked to the 3' tenninus of the nudelc acid sequence encoding the polypeptide. Any 
tenmlnator which is functional In the host cell of choice may be used in the present invention. 

The control sequence may also be a polyadenylation sequence, a sequence which is 
operably linked to the 3' temninus of the nudelc acid sequence and which, when transcribed. Is 
recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. 
Any polyadenylation sequence which is functional in the host cell of choice may be used In the 
present invention. 

The control sequence may also be a signal peptide coding region, which codes for an 
amino acid sequence linked to the amino terminus of the polypeptide which can direct the 
expressed polypeptide into the cell's secretory pathway of the host cell. The 5' end of the 
coding sequence of the nucleic acid sequence may inherently contain a signal peptide coding 
region naturally linked in translation reading frame with the segment of the coding region which 
encodes the secreted polypeptide. Alternatively, the 5' end of the coding sequence may 
contain a signal peptide coding region which Is foreign to that portion of the coding sequence 
which encodes the secreted polypeptide. A foreign signal peptide coding region may be 
required where the coding sequence does not normally contain a signal peptide coding region. 
Alternatively, the foreign signal peptide coding region may simply replace the natural signal 
peptide coding region in order to obtain enhanced secretion relative to the natural signal 
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peptide coding region nomially associated with the coding sequence. The signal peptide 
coding region may be obtained from a glucoamylase or an amylase gene from an Aspergillus 
species, a lipase or proteinase gene from a Rhizomucor species, the gene for the alpha-factor 
from Saccharomyces cerevislae, an amylase or a protease gene from a Bacillus species, or 
the calf preprochymosin gene. However, any signal peptide coding region capable of directing 
the expressed polypeptide Into the secretory pattiway of a host cell of choice may be used In 
tiie present invention. 

The control sequence may also be a propeptide coding region, which codes for an 
amino acid sequence positioned at the amino terminus of a polypeptide. The resultant 
polypeptide is known as a proenzyme or propolypeptide (or a zymogen In some cases). A 
propoiypeptide is generally inactive and can be converted to mature active polypeptide by 
catalytic or autocatalytic cleavage of the propeptide from the propoiypeptide. The propeptide 
coding region may be obtained from the Bacillus subtilis alkaline protease gene (aprE), the 
Bacillus subtilis neutral protease gene (nprT), the Saccharomyces cerevisiae alpha-factor 
gene, or the Mycellophthora thenmophilum laccase gene (WO 95/33836). 

The nucleic acid constructs of tine present invention may also comprise one or more 
nucleic acid sequences which encode one or more factors that are advantageous in the 
expression of the polypeptide, e.g., an activator (e.g., a trans-acting factor), a chaperone, and 
a processing protease. Any factor that is functional in the host cell of choice may be used in 
the present invention. The nucleic acids encoding one or more of these factors are not 
necessarily in tandem with the nucleic acid sequence encoding the polypeptide. 

An activator is a protein which activates transcription of a nucleic acid sequence 
encoding a polypeptide (Kudla et ai., 1 990, EMBO Journal 9:1355-1364; Jarai and Buxton, 
1994, Cun-ent Genetics 26:2238-244; Verdler, 1990, Yeast 6:271-297). The nucleic acid 
sequence encoding an activator may be obtained from the genes encoding Bacillus 
stearothemnophllus NprA (nprA), Saccharomyces cerevisiae heme activator protein 1 (hap1), 
Saccharomyces cerevisiae galactose metabolizing protein 4 (gal4), and Aspergillus nidulans 
ammonia regulation protein (areA). For further examples, see Verdler, 1990, supra and 
MacKenzie et ai.. 1993, Journal of General Microbiology 139:2295-2307. 

A chaperone is a protein which assists another polypeptide in folding properly (Harti et 
ai., 1994, TIBS 19:20-25; Bergeron et al., 1994, TIBS 19:124-128; Demolder et al., 1994, 
Journal of Biotechnology 32:179-189; Craig, 1993, Science 260:1902-1903; Gething and 
Sambrook, 1992, Nature 355:33-45; Puig and Gilbert, 1994, Joumal of Biological Chemistry 
269.7764-7771; Wang and Tsou, 1993, The FASEB Joumal 7:1515-11157; Robinson et al., 
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1994. Bio/Technology 1:381-384). The nucleic acid sequence encoding a chaperone may be 
obtained from the genes encoding Bacillus subtills OroE proteins, Aspergillus oryzae protein 
disulphide IsomerasSt Saccharomyces oerevisiae cainexin, Saccharomyces cerevlsiae 
BiP/GRP78, and Saccharomyces cerevlsiae IHspTO. For further examples, see Gething and 
Sambrook, 1992, supra, and HartI et a!., 1994. supra. 

A processing protease is a protease that cleaves a propeptide to generate a mature 
biochemically active polypeptide (Endertin and Ogrydzlalc. 1994. Yeast 10:67-79; Fuller et ah. 
1989, Proceedings of the National Academy of Sciences USA 86:1434-1438; Julius et at.. 
1984, Cell 37:1075-1089; Julius et at.. 1983. Cell 32:839-852). The nucleic acid sequence 
encoding a processing protease may be obtained from the genes encoding Aspergillus niger 
Kex2, Saccharomyces cerevisiae dipeptidylamlnopeptldase. Saccharomyces cerevlsiae Kex2. 
and Yarrowla tipolytica dibasic processing endoprotease (xpr6). 

it may also be desirable to add regulatory sequences which allow the regulation of the 
expression of the polypeptide relative to the growth of the host celt. Examples of regulatory 
systems are those which cause the expression of the gene to be tumed on or off In response 
to a chemical or physical stimulus, including the presence of a regulatory compound. 
Regulatory systems in proicaryotic systems would include the lac, tac, and trp operator 
systems, in yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the 
TAICA alpha-amylase promoter, Aspergillus niger glucoamylase promoter, and the Aspergillus 
oryzae glucoamylase promoter may be used as regulatory sequences. Other examples of 
regulatory s equences a re t hose w hich a How f or g ene a mplification. I n e ukaryotic s ystems, 
these Include the dlhydrofolate reductase gene which Is amplified in the presence of 
methotrexate, and the metallothionein genes which are amplified with heavy metals. In these 
cases, the nucleic acid sequence encoding the polypeptide would be placed in tandem with the 
regulatory sequence. 

Nucleic acid sequence library 

Preparation of a nucleic acid sequence library can be achieved by use of known 
methods. 

Procedures for extracting genes from a cellular nucleotide source and preparing a gene 
library a re d escribed in e.g. Pitcher etal., "Rapid extraction of bacterial genomic DNA with 
guanidium thiocyanate", Lett. Appl. Microbiol., 8, pp 151-156, 1989, Dretzen, G. et ai., "A 
reliable method for the recovery of DNA fragments from agarose and acrylamide gels". Anal. 
Biochem., 112, pp 295-298, 1981, WO 94/19454 and Diderichsen et ai., "Cloning of aldB, 
which encodes alpha-acetolactate decarboxylase, an exoenzyme from Bacillus brevis". J. 
Bacteriol., 172, pp 4315-4321. 1990. 
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Procedures for preparing a gene library from an in vitro made synthetic nucleotide 
source can be found In (e.g. described by Stemmed Proc. Nati. Acad. Sd. USA» 91. pp. 
10747-10751. 1994 or WO 95/17413). 

5 Promoters 

Examples of suitable promoters for directing the transcription of the nucleic acid 
constructs of the present invention, especially In a bacterial host cell, are the promoters 
obtained from the E. coli lac operon. the Streptomyces coelicolor agarase gene (dagA). the 
Bacillus subtilis levansucrase gene (sacB). the Bacillus subtills alkaline protease gene, the 

10 Bacillus licheniformis alpha-amylase gene (amyL), the Bacillus stearothenmophilus maltogenic 
amylase gene (amyM). the Bacillus amyloliquefaciens alpha-amylase gene (amyQ). the 
Bacillus amyloliquefaciens BAN amylase gene, the Bacillus iichenlfonmls penicillinase gene 
(penP), the Bacillus subtilis xylA and xylB genes, and the prol<aryotic beta-lactamase gene 
(Vllla-Kamaroff et al., 1978. Proceedings of the National Academy of Sciences USA 75:3727- 

15 3731), as well as the tac promoter (DeBoer et al.. 1983, Proceedings of the National Academy 
of Sciences USA 80:21-25) , or the Bacillus pumilus xylosidase gene, or by the phage Lambda 
PR or PL promoters or the E. coli lac, trp or tac promoters. Further promoters are described In 
"Useful proteins from recombinant bacteria** In Scientific American. 1980, 242:74-94; and in 
Sambrooi^ et al.. 1989. supra. 

20 

Examples of suitable promoters for directing the transcription of the nucleic acid 
constructs of the present invention in a filamentous fungal host cell are promoters obtained 
from the genes encoding Aspergillus oryzae TAKA amylase, Rhizomucor miehel aspartic 
proteinase, Aspergillus niger neutral alpha amylase, Aspergillus niger acid stable alpha- 

25 amylase, A sperglllus n iger o r A spergillus a wamori glucoamylase ( glaA), R hizomucor m iehei 
lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, 
Aspergillus nidulans acetamidase, Fusarium oxysporum trypsin-like protease (as described in 
U.S. Patent No. 4,288,627, which is incorporated herein by reference), and hybrids thereof. 
Particularly preferred promoters for use in filamentous fungal host cells are the TAKA amylase, 

30 NA2-tpi (a hybrid of the promoters from the genes encoding Aspergillus niger neutral ( amylase 
and Aspergillus oryzae triose phosphate isomerase), and glaA promoters. Further suitable 
promoters for use in filamentous fungus host cells are the ADH3 promoter (McKnight et al., 
The EMBO J. 4 (1985). 2093 - 2099) or the tplA promoter. 

35 Examples of suitable promoters for use in yeast host cells Include promoters from yeast 

glycolytic genes (Hitzeman et al.. J. Biol. Chem. 255 (1980), 12073 - 12080; Aiber and 
Kawasaki. J. Mol. Appl. Gen. 1 (1982). 419 - 434) or alcohol dehydrogenase genes (Young et 
al.. in Genetic Engineering of Microorganisms for Chemicals (Hollaender et al. eds.). Plenum 
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Press. New York, 1982). or the TPI1 (US 4.599,311) or ADH2-4c (Russell et al.. Nature 304 
(1983), 652 - 654) promoters. 

Further useful promoters are obtained from the Saccharomyces cerevislae enolase 
5 (ENO-1) gene, the Saccharomyces cerevislae galactoklnase gene (GAL1), the 
Saccharomyces cerevislae alcohol dehydrogenase/glyceraldehyde-3-phosphate 
dehydrogenase genes (ADH2/GAP), and the Saccharomyces cerevislae 3-phosphoglycerate 
kinase gene. Other useful promoters for yeast host celts are described by Romanes et al.. 
1992, Yeast 8:423-488. In a mammalian host cell, useful promoters include viral promoters 
10 such as those from Simian Virus 40 (SV40), Rous sarcoma virus (RSV). adenovirus, and 
bovine papilloma virus (BPV). 

Examples of suitable promoters for directing the transcription of the DNA encoding the 
polypeptide of the invention in mammalian ceils are the SV40 promoter (Subramani et al., Mol. 
15 Ceil Biol. 1 (1981). 854 -864), the MT-1 (metallothionein gene) promoter (Palmlter et al.. 
Science 222 (1983), 809 • 814) or the adenovirus 2 major late promoter. 

An example of a suitable promoter for use in insect celts is the polyhedrin promoter (US 
4,745.051; Vasuvedan et at.. FEBS Utt 311. (1992) 7-11). the P10 promoter (J.M. Vlak et 
20 al.. J. Gen. Virology 69, 1988. pp. 765-776). the Autographa caiifornica polyhedrosis virus 
basic protein promoter (EP 397 485). the baculovims immediate early gene 1 promoter (US 
5.155,037; US 5,162,222), or the baculovims 39K delayed-early gene promoter (US 
5,155.037; US 5,162,222). 

25 Terminators 

Preferred terminators for filamentous fungal host celts are obtained from the genes 
encoding Aspergillus oryzae TAI<A amylase. Aspergillus niger glucoamylase, Aspergillus 
nidulans anthranltate synthase, Aspergiiius niger aipha-glucosidase. and Fusarium oxysporum 
trypsin-lil<e protease, for fungal hosts) the TPI1 (Aiber and Kawasatd. op. cit.) or ADH3 
30 (McKnight et at., op. cit.) tenninators. 

Prefenred tenninators for yeast host ceils are obtained from the genes encoding 
Saccharomyces cerevislae enolase. Saccharomyces cerevislae (^ochrome C (CYC1), or 
Saccharomyces cerevislae glyceraldehyde-3-phosphate dehydrogenase. Other useful 
35 tenminators for yeast host cells are described by Romanes et al.. 1992. supra. 

Poiyadenyiation Signals 

Preferred poiyadenyiation sequences for filamentous fungal host cells are obtained 
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from the genes encoding Aspergillus oryzae TAKA amylase, Aspergillus niger giucoamylase, 
Aspergillus nldulans anthranllate synthase, and Aspergillus niger alpha-gluoosidase. 
Useful polyadenylation sequences for yeast host cells are described by Guo and Sherman, 
1995. Molecular Cellular Biology 15:5983-5990. 

Signal Sequences 

An effective signal peptide coding region for bacterial host ceils is the signal peptide 
coding region obtained from the maltogenlc amylase gene from Bacillus NCIB 11837. the 
Bacillus stearothermophilus alpha-amylase gene, the Bacillus lichenifonnis subtillsin gene, the 
Bacillus iichenifonnls beta-lactamase gene, the Bacillus stearothemiophilus neutral proteases 
genes (nprT. nprS. nprM). and the Bacillus subtilis PrsA gene. Further signal peptides are 
described by Simonen and Paiva. 1993, Microbiological Reviews 57:109-137. 

An effective signal peptide coding region for filamentous fungal host cells is the signal 
peptide coding region obtained from Aspergillus oryzae TAKA amylase gene, Aspergillus niger 
neutral amylase gene, the Rhizomucor miehei aspartic proteinase gene, the i-lumicola 
lanuginosa cellulase or lipase gene, or the Rhizomucor miehei lipase or protease gene, 
Aspergillus sp. amylase or giucoamylase. a gene encoding a Rhizomucor miehei lipase or 
protease. The signal peptide is preferably derived from a gene encoding A. oryzae TAKA 
amylase. A. niger neutral alfa-amyiase. A. niger acid-stable amylase, or A. niger giucoamylase. 

Useful signal peptides for yeast host ceils are obtained from the genes for 
Saccharomyces cerevlsiae a-factor and Saccharomyces cerevisiae invertase. Other useful 
signal peptide coding regions are described by Romanes et ai.. 1992. supra. 

For secretion from yeast cells, the secretory signal sequence may encode any signal 
peptide which ensures efficient direction of the expressed polypeptide into the secretory 
pathway of the cell. The signal peptide may be a naturally occuning signal peptide, or a 
functional pari: thereof, or it may be a synthetic peptide. Suitable signal peptides have been 
found to be the a-factor signal peptide (cf. US 4,870,008), the signal peptide of mouse salivary 
amylase (cf. O. Hagenbuchle et al.. Nature 289, 1981, pp. 643-648), a modified 
carijoxypeptidase signal peptide (cf. L.A. Vails et al., Cell 48, 1987, pp. 887-897), the yeast 
BAR1 signal peptide (cf. WO 87/02670). or the yeast aspartic protease 3 (YAP3) signal 
peptide (cf. M. Egei-Mitani et al.. Yeast 6, 1990, pp. 127-137). 

For e fficient s ecretion I n y east, a s equence e needing a I eader p eptide may a Iso b e 
inserted downstream of the signal sequence and uptream of the DNA sequence encoding the 
polypeptide. The function of the leader peptide is to allow the expressed polypeptide to be 
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directed from the endoplasmic reticulum to the Golgl apparatus and further to a secretory 
vesicle for secretion Into the culture medium (i.e. exportation of the polypeptide across the cell 
wall or at least through the cellular membrane Into the periptasmic space of the yeast cell). The 
leader peptide may be the yeast a-factor leader (the use of which is described In e.g. US 
4,546,082, EP 16 201, EP 123 294, EP 123 544 and EP 163 520). Alternatively, the leader 
peptide may be a synthetic leader peptide, which is to say a leader peptide not found in nature. 
Synthetic leader peptides may, for instance, be constmcted as described in WO 89/02463 or 
WO 92/1 1378. 

Expression Vectors 

The present invention also relates to recombinant expression vectors comprising a 
nucleic acid sequence of the present invention, a promoter, and transcriptional and 
translational stop signals. The various nucleic add and control sequences described above 
may be joined together to produce a recombinant expression vector which may include one or 
more convenient restriction sites to allow for insertion or substitution of the nucleic acid 
sequence encoding the polypeptide at such sites. Altematively, the nucleic acid sequence of 
the present invention may be expressed by inserting the nucleic acid sequence or a nucleic 
acid construct comprising the sequence into an appropriate vector for expression. In creating 
the expression vector, the coding sequence is located in the vector so that the coding 
sequence Is operably linked with the appropriate control sequences for expression, and 
possibly secretion. 

The recombinant expression vector may be any vector (e.g., a piasmid or virus) which 
can be conveniently subjected to recombinant DNA procedures and can bring about the 
expression of the nucleic acid sequence. The choice of the vector will typically depend on the 
compatibility of the vector with the host cell Into which the vector is to be introduced. The 
vectors may be linear or closed circular plasmids. The vector may be an autonomously 
replicating vector, i.e., a vector which exists as an extrachromosomal entity, the replication of 
which is independent of chromosomal replication, e.g., a plasmld, an extrachromosomal 
element, a minlchromosome, or an artificial chromosome. The vector may contain any means 
for assuring self-replication. Alternatively, the vector may be one which, when Introduced into 
the host cell, is Integrated Into the genome and replicated together with the chromosome(s) 
Into which It has been Integrated. The vector system may be a single vector or plasmld or two 
or more vectors or plasmids which together contain the total DNA to be introduced Into the 
genome of the host cell, or a transposon. 

The vectors of the present invention preferably contain one or more selectable markers 
which penult easy selection of transfonned ceils. A selectable marker Is a gene the product of 
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Which provides for blodde or viral resistance, resistance to heavy metals, prototrophy to 
auxotrophs, and the lil<e. Examples of bacterial selectable markers are the dal genes from 
Bacillus subtilis or Bacillus iichenlfomnis, or maricers which confer antibiotic resistance such as 
ampiciliin, kanamycin, chloramphenicol, tetracycline, neomycin, hygromycin or methotrexate 
resistance. 

Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and 
URA3. A selectable marker for use in a filamentous fungal host cell may be selected from the 
group including, but not limited to, amdS (acetamidase), argB (omithine arbamoyltransferase), 
bar (phosphinothricin aoetyltransferase), hygB (hygromycin phosphotransferase), niaD (nitrate 
reductase), pyrG (orotldlne-5'-phosphate decarboxylase), sC (sulfate adenyltransferase), trpC 
(anthranilate synthase), and glufosinate resistance markers, as well as equivalents from other 
species. Prefenred for use in an Aspergillus cell are the amdS and pyrG markers of Aspergillus 
nidulans or Aspergillus oryzae and the bar marker of Streptomyces hygroscopicus. 
Furthemnore, selection may be accomplished by co-transformation, e.g., as described In WO 
91/17243, where the selectable marker is on a separate vector. 

The vectors of the present invention preferably contain an element(s) that penmits 
stable integration of the vector into the host cell genome or autonomous replication of the 
vector in the cell independent of the genome of the ceil. 

The vectors of the present invention may be integrated Into the host cell genome when 
introduced into a host cell. For integration, the vector may rely on the nucleic acid sequence 
encoding the polypeptide or any other element of the vector for stable Integration of the vector 
Into the genome by homologous or nonhomologous recombination. Altematively, the vector 
may contain additional nucleic acid sequences for directing integration by homologous 
recombination into the genome of the host cell. The additional nucleic acid sequences enable 
the vector to be integrated into the host cell genome at a precise location(s) in the 
chromosome(s). To increase the likelihood of integration at a precise location, the 
integrationai elements should preferably contain a sufficient number of nucleic acids, such as 
100 to 1,500 base pairs, preferably 400 to 1,500 base pairs, and most preferably 800 to 1.500 
base pairs, which are highly homologous with the conresponding target sequence to enhance 
the probability of homologous recombination. The integrationai elements may be any 
sequence that is homologous with the target sequence in the genome of the host cell. 
Furthermore, the integrationai elements may be non-encoding or encoding nucleic acid 
sequences. On the other hand, the vector may be integrated Into the genome of the host ceil 
by non-homologous recombination. These nucleic acid sequences may be any sequence that 
is homologous with a target sequence In the genome of the host cell, and, furthermore, may be 
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For autonomous replication, the vectorm ay further comprise an origin of replication 
enabling the vector to replicate autonomously in the host cell In question. Examples of 
bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, 
PACYC177. PACYC184, pUB110, pE194, pTA1060, and pAMB1. Examples of origin of 
replications for use in a yeast host cell are the 2 micron origin of replication, the combination of 
CEN6 and ARS4, and the combination of CEN3 and ARS1. The origin of replication may be 
one having a mutation which mai<es its functioning temperature-sensitive in the host cell (see, 
e.g., Ehrlich, 1 978, Proceedings of the National Academy of Sciences USA 75:1433). 

More than one copy of a nucleic acid sequence encoding a polypeptide of the present 
invention may be inserted into the host cell to amplify expression of the nucleic acid sequence. 
Stable amplification of the nucleic acid sequence can be obtained by integrating at least one 
additional copy of the sequence Into the host ceil genome using methods well known in the art 
and selecting for transformants. 

The procedures used to ligate the elements described above to construct the 
recombinant expression vectors of the present invention are well l<nown to one skilled In the art 
(see, e.g., Sambrook et al., 1989, supra). 

Host Cells 

The present invention also relates to recombinant host ceils, comprising a nudeic acid 
sequence of the invention, which are advantageously used in the recombinant production of 
the polypeptides. The temn "host cell" encompasses any progeny of a parent cell which Is not 
identical to the parent cell due to mutations that occur during replication. 

The cell is preferably transformed with a vector comprising a nucleic acid sequence of 
the invention followed by integration of the vector into the host chromosome. Transfonmation" 
means Introducing a vector comprising a nucleic add sequence of the present invention into a 
host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating 
extra-chromosomal vector. Integration is generally considered to be an advantage as the 
nudeic a cid s equence i s m ore I ikely t o b e s tably m aintalned i n t he c ell. I ntegration of the 
vector into the host chromosome may occur by homologous or non-homologous recombination 
as described above. 

The choice of a host cell will to a large extent depend upon the gene encoding the 
polypeptide and its source. The host ceil may be a unicellular microorganism, e.g., a 
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prokaryote. or a non-unicellular microorganism, e.g., a eukaryote. Useful unicellular cells are 
bacterial cells such as gram positive bacteria including, but not limited to, a Bacillus cell, e.g., 
Bacillus aikalophilus. Bacillus amyloliquefaciens, Bacillus brevis. Bacillus drculans, Bacillus 
coagulans, Bacillus lautus, Bacillus ientus. Bacillus ilcheniformls. Bacillus megaterium. Bacillus 
stearothemnophilus. Bacillus subtilis, and Bacillus thuringlensis; or a Streptomyces cell, e.g., 
Streptomyces lividans or Streptomyces murinus, or gram negative bacteria suchi as E. coll and 
Pseudomonas sp. In a preferred embodiment, the bacterial host cell is a Bacillus Ientus, 
Bacillus lichenifonmis, Bacillus stearothemiophilus or Bacillus subtilis cell. The transfomnation 
of a bacterial host cell may, for instance, be effected by protoplast transfomnation (see, e.g., 
Chang and Cohen, 1979, Molecular General Genetics 168:111-115), by using competent ceils 
(see, e.g.. Young and Spizizin, 1961, Journal of Bacteriology 81:823-829, or Dubnar and 
Davidoff-Abelson, 1971, Journal of Molecular Biology 56:209-221), by electroporation (see, 
e.g., Shlgekawa and Dower, 1988, Biotechniques 6:742-751), or by conjugation (see, e.g., 
Koehler and Thome, 1987, Journal of Bacteriology 169:5771-5278). 

The host cell may be a eukaryote. In a prefenred embodiment, the host cell is a fungal 
cell. "Fungi" as used herein includes the phyla Ascomycota, Basidlomycota, Chytrldiomycota, 
and Zygomycota (as defined by Hawksworth et al.. In, Ainsworth and Btsby's Dictionary of The 
Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK) as well as the 
Oomycota (as dted in Hawksworth et al., 1995, supra, page 171) and all mitosporic fungi 
(Hawksworth et al., 1995, supra). Representative groups of Ascomycota include, e.g., 
Neurospora, Eupenicillium (=Peniclllium), Emericella (=Aspergillus), Eurotium (=Aspergillus), 
and the true yeasts listed above. Examples of Basidlomycota include mushrooms, rusts, and 
smuts. Representative groups of Chytrldiomycota include, e.g., Allomyces, Blastocladiella, 
Coelomomyces, and aquatic fungi. Representative groups of Oomycota include, e.g., 
Saprolegniomycetous aquatic fungi (water molds) such as Achlya. Examples of mitosporic 
fungi include Aspergillus, Penicillium, Candida, and Altemaria. Representative groups of 
Zygomycota include, e.g., Rhizopus and Mucor. 

in a prefen-ed embodiment, the fungal host cell is a yeast cell. "Yeasf as used herein 
includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast 
belonging to the Fungi ImperfectI (Blastomycetes). The ascosporogenous yeasts are divided 
into t he families S permophthoraceae a nd S accharomycetaceae. The I atter i s comprised of 
four subfamilies, Schizosaccharomycoideae (e.g., genus Schizosaccharomyces), 
Nadsonioideae, Lipomycoideae, and Saccharomycoldeae (e.g., genera Pichia, Kiuyveromyces 
and Saccharomyces). The basidiosporogenous yeasts Include the genera Leucosporidim, 
Rhodosporidium, Sporidiobolus, Filobasidium, and Filobasidiella. Yeast belonging to the Fungi 
Imperfect! are divided into two families, Sporobolomycetaceae (e.g., genera Sorobolomyces 
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and Bullera) and C ryptocx>ccac6ae (e.g.* genus Candida). S ince the classification of yeast 
may cliange In the future, for the purposes of this Invention, yeast shall be defined as 
described In Biology and Activities of Yeast (Skinner, F.A., Passmore, S.M., and Davenport, 
R.R., eds. Soc. App. Bacterid. Symposium Series No. 9, 1980. The biology of yeast and 
manipulation of yeast genetics are well known In the art (see, e.g., Biochemistry and Genetics 
of Yeast, Bacil, M., Horecker, B.J., and Stopani, A.O.M., editors, 2nd edition, 1987; The 
Yeasts, Rose, A.H., and Harrison, J.S., editors, 2nd edition, 1987; and The Molecular Biology 
of the Yeast Saccharomyces, Strathem et al., editors, 1981). 

The yeast host cell may be selected from a cell of a species of Candida, 
Kluyveromyces, Saccharomyces, Schlzosaccharomyces, Candida, Pichia, Hansehula, or 
Yanrowla. In a preferred embodiment, the yeast host cell is a Saccharomyces carlsbergensis, 
Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasli, 
Saccharomyces kluyveri, Saccharomyces norbensis or Saccharomyces oviformis cell. Other 
useful yeast host cells are a Kluyveromyces lactts Kluyveromyces fragilis IHansehula 
polymorpha, Pichia pastorls Yarrowia llpolytica, Schlzosaccharomyces pombe, Ustilgo maylis, 
Candida maltose, Pichia guillemnondii and Pichia methanolio cell (of. Gleeson et al., J. Gen. 
Microbiol. 132, 1986, pp. 3459-3465; US 4,882,279 and US 4,879,231). 

In a preferred embodiment, the fungal host cell is a filamentous fungal cell. Filamentous 
fungi" include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by 
Hawksworth et al., 1995, supra). The filamentous fungi are characterized by a vegetative 
mycelium composed of chitin, cellulose, glucan, chltosan, mannan, and other complex 
polysaccharides. Vegetative growth is by hyphal elongation and carbon catabollsm is 
obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces 
cerevisiae Is by budding of a unicellular thallus and carbon catabollsm may be fermentative. In 
a more preferred embodiment, the filamentous fungal host cell is a cell of a species of, but not 
limited to, Acremonium, Aspergillus, Fusarlum, Humicola, Mucor, fVlyceliophthora, Neurospora, 
Penlcilllum, Thlelavla, Tolypocladlum, and Trichodenma or a teleomorph or synonym thereof. 
In an even more preferred embodiment, the filamentous fungal host cell is an Aspergillus cell. 
In another even more prefenred embodiment, the filamentous fungal host cell Is an 
Acremonium cell. In another even more preferred embodiment, the filamentous fungal host 
cell is a Fusarlum cell. In another even more preferred embodiment, the filamentous fungal 
host cell is a Humicola cell. In another even more preferred embodiment, the filamentous 
fungal host cell Is a Mucor cell. In another even more preferred embodiment, the filamentous 
fungal host cell is a Mycellophthora cell. In another even more prefenred embodiment, the 
filamentous fungal host cell Is a Neurospora cell. In another even more preferred embodiment, 
the filamentous fungal host cell is a Peniclllium cell. In another even more preferred 
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embodiment, the filamentous fiingai tiost cell Is a Thielavia cell. In another even more 
preferred emtxxiiment. the filamentous fungal host cell is a Tolypocladlum cell. I n another 
even more preferred emt)odiment. the filamentous fungal host cell is a Trichodemna cell, in a 
most prefen^ embodiment, the filamentous fungal host cell is an Aspergillus awamori, 

5 Aspergillus foetidus, Aspergillus Japonlcus, Aspergillus niger. Aspergillus nidulans or 
Aspergillus oryzae cell. In another most preferred embodiment, the filamentous fungal host 
cell is a Fusarium cell of the section Discolor (also known as the section Fusarium). For 
example, the filamentous fungal parent cell may be a Fusarium bactridioides, Fusarium 
cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium gramlnearum, Fusarium 

10 graminum, Fusarium heterosporum, Fusarium negundi, Fusarium reticulatum, Fusarium 
roseum, Fusarium sambuclnum, Fusarium sarcochroum, Fusarium sulphureum, or Fusarium 
trichothecloides cell. In another prefered embodiment, the filamentous fungal parent cell is a 
Fusarium strain of the section Elegans. e.g., Fusarium oxysporum. In another most prefen^ 
embodiment, the filamentous fungal host cell Is a Humicola insolens or Humicola lanuginosa 

15 cell. In another most prefenBd embodiment, the filamentous fungal host cell is a Mucor miehei 
cell. In another most prefen-ed embodiment, the filamentous fungal host cell is a 
Mycellophthora thenmophiium cell. In another most prefen-ed embodiment, the filamentous 
fungal host ceil is a Neurospora crassa cell. In another most prefenred embodiment, the 
filamentous fungal host cell is a PenlcilHum purpurogenum ceil. In another most preferred 

20 embodiment, t he filamentous fungal h ost c ell 1 s a T hlelavia t errestris c ell o r a A cremonium 
chrysogenum cell. In another most prefenred embodiment, the Trichoderma cell is a 
Trichoderma harzianum, Trichodemia konlngii. Trichodenma longibrachlatum, Trichodemia 
reesei or Trichodemna viride cell. The use of Aspergillus spp. for the expression of proteins Is 
described In. e.g.. EP 272 277. EP 230 023. 

25 

Transformation 

Fungal cells may be transfomied by a process involving protoplast formation, 
transfomnation of the protoplasts, and regeneration of the cell wall In a manner known per se. 
Suitable procedures for transformation of Aspergillus host cells are described In EP 238 023 

30 and Yelton et al., 1984, Proceedings of the National Academy of Sciences USA 81:1470-1474. 
A suitable method of transfonning Fusarium species is described by Malardier et al.. 1989, 
Gene 78:147-156 or In copending US Serial No. 08/269,449. Examples of other fungal cells 
are cells of filamentous fungi, e.g. Aspergillus spp.. Neurospora spp.. Fusarium spp. or 
Trichoderma spp.. in particular strains of A. oryzae. A. nidulans or A. nIger. The use of 

35 Aspergillus spp. for the expression of proteins is described in. e.g.. EP 272 277. and EP 230 
023. The transformation of F. oxysporum may. for Instance, be canried out as described by 
Malardier etal.. 1989, Gene 78: 147-156. 
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Yeast may be transformed using the procedures described by Becker and Guarente, In 
Abelson, J.N. and Simon, M.I.. editors, Guide to Yeast Genetics and Molecular Biology. 
Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New York; Ito et al., 
1983, Journal of Bacteriology 153:163; and HInnen et al., 1978, Proceedings of the National 
5 Academy of Sciences USA 75:1920. Mammalian cells may be transfomied by direct uptake 
using the calcium phosphate precipitation method of Graham and Van der Eb (1978, Virology 
52:546). 

Manipulating the nucleic acid sequences of a library 
10 In a particular embodiment the genes of a gene library may before, during or after 

initiating the screening be subjected to alterations and or mutations by genetic engineering. 
Generation of libraries of genes encoding variants of enzymes can be done in a variety of 
ways: 

15 (1) Error prone PGR employs a low fidelity replication step to Introduce random point 
mutations at each round of amplification (Caldwell and Joyce (1992), PGR Methods and 
Applications vol.2 (1), pp.28-33). Error-prone PGR mutagenesis is perfonned using a plasmid 
encoding the wild-type, i.e. wt, gene of interest as template to amplify this gene with flanking 
primers under PGR conditions where Increased enror rates leads to introduction of random 

20 point mutations. The PGR conditions utilized are typically: 10 mM Tris-HGI, pH 8.3, 50 mM 
KGI, 4 mM MgGI2, 0.3 mM MnGI2, 0.1 mM dGTP/dATP, 0.5 mM dTTP/dGTP, and 2.5 u Taq 
polymerase per 100 micro L of reaction. The resultant PGR fragment Is purified on a gel and 
cloned using standard molecular biology techniques. 

25 (2) Oligonucleotide directed mutagenesis in single codon position (Including deletions or 
insertions), e.g. by SOE-PGR is described by Klrchhoff and Desroslers, PGR Methods and 
Applications, 1993, 2, 301-304. This method is performed as follows: Two Independent PGR 
reactions are performed with 2 internal, overlapping primers, wherein one or both contain a 
mutant sequence and 2 extemal primers, which may encode restriction sites, thereby creating 

30 2 overiapping PGR fragments. These PGR fragments are purified, diluted, and mixed In molar 
ratio 1:1. The full length PGR product Is subsequently obtained by PGR amplification with the 
extemal primers. The PGR fragment is purified on gel and cloned using standard molecular 
biology techniques. 

35 (3) Oligonucleotide directed randomization in single codon position, such as saturation 
mutagenesis, may be done e.g. by SOE-PGR as described above, but using primers with 
randomized nucleotides. For example NN(G/T), wherein N is any of tiie 4 bases G,A,T or G, 
will yield a mixture of codons encoding all possible amino acids. 
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(4) Combinatorial site-directed mutagenesis libraries may be employed, where several oodons 
can be mutated at once using (2) and (3) above. For multiple sites, several overlapping PGR 
fragments are assembled simultaneously in a SOE-PCR setup. 

5 

(5) Anotiier protocol employs synthetic gene libraries preparation. Wild type. i.e. wt. genes can 
be assembled from multiple overiapping oligonucleotides (typically 40-100 nucleotides in 
length; ( Stemmer e t a I., ( 1 995). G ene 1 64, 4 9-53). B y i ncluding m Ixtures of w t a nd m utant 
variants of the same ollgo at various positions in the gene, the resulting assembled gene will 

1 0 contain mutations at various positions with mutagenic rates corresponding to the ratios of wt to 
mutant primers. 

(6) Still another method employs multiple mutagenic primers to generate libraries with multiple 
mutated positions. First an uracll-contalning nucleotide template encoding a polypeptide of 

15 interest is generated and 2-50 mutagenic primers con-espondlng to at least one region of 
Identity in ttie nucleotide template are synttiezised so ttiat each mutagenic primer comprises at 
least one substitution of the template sequence (on insertion/deietlon of bases) resulting in at 
least one amino acid substitution (or Insertion/deletion) of the amino acid sequence encoded 
by the uracil-containing nucleotide template. The mutagenic primers are then contacted wiUi 

20 ttie uracll-contalning nucleotide template under conditions wherein a mutagenic primer anneals 
to the template sequence. This Is followed by extension of the primer(s) catalyzed by a 
polymerase to generate a mixture of mutagenized polynucleotides and uracll-contalning 
templates. Finally, a host cell is transfomned witii the polynucleotide and template mixture 
wherein the template is degraded and tiie mutagenized polynucleotide replicated, generating a 

25 library of polynucleotide variants of ttie gene of interest. 

(7) Libraries may be created by shuffling e.g. by recombination of two or more wt genes or 
genes encoding variant proteins created by any combination of metiiods (1)-(6) (above) by 
DMA shuffling. 

30 Fusion protein 

Fusion protein consists of two proteins which are connected, possibly by a linker 
peptide. The fusion protein has two functions originated from each protein (e. g. en^me 
activity, antl-mlcroblai activity). The two various nucleic acid sequence of two proteins may be 
Joined togetiier witti a linker nucleic acid sequence by PGR technique, ligation or In vivo 
35 recombination. 
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Polypeptide linker 

The fusion protein of the present invention preferably contains one polypeptide linl<er 
which gives the proper flexibility to permit both proteins' activity expression. The linker 
sequence may be any linker which can connect two proteins covalentiy. The length of the 
5 linker depends on the target protein itself (e.g. stability, hydrophoblclty). Examples of linkers, 
but not limited to these linkers, are Poly-Arg, Poly-Hls, PEPTPEPT, FLAG, Strep-tag II, o^nyc, 
S-, HAT-, 3xFLAG, Calmoludln-blnding peptide, Cellulose-blndIng domain, SBP, Chltln-binding 
domain, Glutathione S-transferase, Maltose-binding domain (see Terpe, K., 2003, Applied 
Microbiology and Biotechnok>gy. 60(5):523-533). 

10 

Methods of Production 

The transfomied or transfected host cells described above are cultured in a suitable 
nutrient medium under conditions pemiitting the production of the desired molecules, after 
which these are recovered from the cells, or the culture broth. 

15 

The medium used to culture the cells may be any conventional medium suitable for 
growing the host cells, such as minimal or complex media containing appropriate supplements. 
Suitable media are available from commercial suppliers or may be prepared according to 
published recipes (e.g. in catalogues of the American Type Culture Collection). The media are 
20 prepared using procedures known in the art (see, e.g., references for bacteria and yeast; 
Bennett, J.W. and LaSure, L.. editors, More Gene Manipulations in Fungi, Academic Press. 
CA. 1991). 

The cells may be cultured in any suitable container-unit, e.g. a shake flask, 24 well 
25 plates, 96 well plates. 384 well plates. 1536 well plates, or a higher number of wells per plate, 
or nanoliter well-less compartments. 

in order to increase the number of individual activity assays perfomned In a given time 
the activity may conveniently be assayed in a high-throughput screening system using 96 well 
30 plates, 384 well plates, 1536 well plates, or a higher number of wells per plate, or nanoliter 
well-less compartments. Such screening techniques are well known in the art, see e.g. Dove, 
A., Nature Biotechnology (17), 1999. 859-863. and Keii. D.. trends in Biotechnology (17), 1999, 
89-91. 

35 If the molecules are secreted into the nutrient medium, they can be recovered directly 

from the medium, if they are not secreted, they can be recovered from cell lysates. The 
molecules are recovered from the culture medium by conventional procedures including 
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separating the host celts from the medium by centrifugation or filtration, precipitating the 
proteinaceous components of the supernatant or filtrate by means of a salt. e.g. ammonium 
sulphate, purification by a variety of chromatographic procedures, e.g. Ion exchange 
chromatography, gelfiltratlon chromatography, affinity chromatography, or the Wke, dependent 
5 on the type of molecule In question. 

The molecules of interest may be detected using methods itnown In the art that are 
specific for the molecules. These detection methods may include use of specific antibodies, 
formation of a product, or disappearance of a substrate. For example, an enzyme assay may 
10 be used to determine the activity of the molecule. Procedures for detemiining various Idnds of 
activity are Itnown In the art 

The molecules of the present invention may be purified by a variety of procedures 
known In the art including, but not limited to, chromatography (e.g.. Ion exchange, affinity, 
15 hydrophobic, chromatofocuslng. and size exclusion), electrophoretic procedures (e.g., 
preparative isoelectric focusing (lEF). differential solubility (e.g., ammonium sulfate 
precipitation), or extraction (see, e.g.. Protein Purification. J-C Janson and Lars Ryden, editors. 
VCH Publishers, New York, 1989). 

20 The terms "relevant protein backbone" or "protein backbone" refer to the polypeptide to 

be modified by creating a library of divereified mutants. The "relevant protein backbone" may 
be a naturally occurring (or wild-type) polypeptide or it may be a variant thereof prepared by 
any suitable means. For instance, the "relevant protein backbone" may be a variant of a 
naturally occurring polypeptide which has been modified by substitution, deletion or truncation 

25 of one or more amino add residues or by addition or Insertion of one or more amino acid 
residues to the amino acid sequence of a naturaiiy-occuning polypeptide. 

in the present Invention the enzyme to be varied as well as the marlcer enzyme may be 
selected from the group of enzymes comprising glycosyi hydrolases, carbohydrases, 
30 peroxidases, proteases, lipases, phytases, polysaccharide lyases, oxidoreductases, transglu- 
taminases and glycoseisomerases, In particular the following. 

Parent Proteases 

Parent proteases (i.e. enzymes classified under the Enzyme Classification number E.G. 3.4 in 
35 accordance with the Recommendations (1992) of the International Union of Biochemistry and 
iVIolecular Biology (iUBi\/IB)) Include proteases within this group. 

Examples include proteases selected from those classified under the Enzyme Ciassificatk>n 
(E.G.) numbers: 
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3.4.11 (i.e. so-called aminopeptidases), Including 3.4.11.5 (Prolyl aminopeptidase). 3.4.11.9 
(X-pro aminopeptidase). 3.4.11.10 (Bacterial leucyl aminopeptidase), 3.4.11.12 (Themnophillc 
aminopeptidase), 3.4.11.15 (Lysyl aminopeptidase). 3.4.11.17 (Tryptophanyl aminopeptidase). 
3.4.11.18 (Mettiionyl aminopeptidase). 
5 3.4.21 (i.e. so-called serine endopeptidases). including 3.4.21.1 (Chymotrypsin), 3.4.21.4 
(Trypsin), 3.4.21.25 (Cucumlsln). 3.4.21.32 (Brachyurin), 3.4.21.48 (Cerevlsin) and 3.4.21.62 
(Subtilisin): 

3.4.22 (i.e. so-called cysteine endopeptidases), including 3.4.22.2 (Papain), 3.4.22.3 (Ficain), 
3.4.22.6 (Ciiymopapaln). 3.4.22.7 (Asdepain). 3.4.22.14 (AcUnldain), 3.4.22.30 (Caricain) and 

10 3.4.22.31 (Ananain); 

3.4.23 (i.e. so-called aspartic endopeptidases), including 3.4.23.1 (Pepsin A), 3.4.23.18 
(Aspergillopepsin 1), 3.4.23.20 (Peniclllopepsin) and 3.4.23.25 (Saccliaropepsin); and 

3.4.24 (i.e. so-called metailoendopeptidases). Including 3.4.24.28 (Bacillolysin). 

Examples of relevant subtiiisins comprise subtilisin BPN*. subtilisin amylosacchariticus, 
15 subtilisin 168, subtilisin mesentericopeptldase. subtilisin Carlsberg, subtilisin DY, subtilisin 309. 
subtilisin 147, tiiermitase, aqualysln. Bacillus PB92 protease, proteinase K, Protease TW7, 
and Protease TW3. 

Specific examples of sucli readily available commercial proteases include Esperase®, 
Aicalase®, Neutrase®, Dyrazym®, Savinase®, Pyrase®. Pancreatic Trypsin NOVO (PTN), 
20 Bio-Feed® Pro, Clear-Lens Pro ® (all enzymes available from Novozymes A/S). 

Examples of other commercial proteases include Maxtase®, Maxacal®, Maxapem® marketed 
by Gist-Brocades N.V., Opticlean® marl<eted by Solvay et Cie. and Purafect® marlceted by 
Genencor international. 

It is to be understood that also protease variants are contemplated as the parent protease. 
25 Examples of such protease variants are disclosed In EP 130.756 (Genentech), EP 214.435 

(Henkel), WO 87/04461 (Amgen), WO 87/05050 (Genex). EP 251.446 (Genencor), EP 

260.105 (Genencor), Thomas et al., (1985), Nature. 318. p. 375-376, Thomas et al., (1987), J. 

Mo\. Biol.. 1 93, pp. 803-813. Russel etal., (1987), Nature, 328, p. 496-500, WO 88/08028 

(Genex), WO 88/08033 (Amgen), WO 89/06279 (Novo Nordisk A/S), WO 91/00345 (Novo 
30 Nordisk A/S), EP 525 61 0 (Solvay) and WO 94/0261 8 (Gist-Brocades N.V.). 

The activity of proteases can be detemnined as described in "iViethods of Enzymatic Analysis". 

third edition. 1984. Veriag Chemie, Weinheim, vol. 5. 

Parent Lipases 

35 Parent lipases (i.e. enzymes classified under the Enzyme Classification number E.C. 3.1.1 
(Cartjoxylic Ester Hydrolases) in accordance with the Recommendations (1992) of the Interna- 
tional Union of Biochemistry and Molecular Biology (lUBMB)) Include lipases within this group. 
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Examples include lipases selected from those classified under the Enzyme Classification 
(E.G.) numbers: 

3.1.1 (i.e. so-called Carboxylic Ester Hydrolases), including (3.1.1.3) Triacylglycerol lipases. 
(3.1.1 .4.) Phosphorlipase A2. 
5 Examples of lipases include lipases derived from the following microorganisms: 

Humlcola. e.g. H. brevispora. H. lanuginosa. H. brsvis var. thermoldea and H. insolens (US 
4,810,414). 

Pseudomonas, e.g. Ps. tragi. Ps. stutzerl, Ps. cepacia and Ps. fluorescens (WO 89/04361). or 

Ps. plantarii or Ps. gladioli (US patent no. 4,950.417 (Solvay enzymes)) or Ps. aicaligenes and 
10 Ps. pseudoaicaligenes (EP 218 272) or Ps. mendodna (WO 88/09367; US 5.389.536). 

Fusarium, e.g. F. oxyspomm (EP 130,064) or F. solani pisi(WO 90/09446). 

Mucor (also called Rhizomucor), e.g. M. miehei (EP 238 023). 

Chromobacterium (especially C. viscosum). Aspergillus (especially A. niger). 

Candida, e.g. C. cyllndracea (also called C. rugosa) or C. antarctica (WO 88/02775) or C. 
15 antarctica lipase A or B (WO 94/01541 and WO 89/02916). 

Geotricum, e.g. G. candidum (Schlmada etal., (1989), J. Blochem.. 106, 383-388). 

Penlcilllum, e.g. P. camembertii (Yamaguchl et al., (1991), Gene 103, 61-67). 

Rhizopus, e.g. R. delemar (Mass et al., (1991). Gene 109, 107-113) or R. nlveus (Kuglmlya et 

ai., (1992) Bloscl.Biotech. Biochem 56, 716-719) or R. oryzae. 
20 Badilus, e.g. B. subtllis (Dartois et al., (1 993) Biochemica et Blophyslca acta 1131. 253-260) or 

B. stearothemiophilus (JP 64/7744992) or B. pumllus (WO 91/16422). 

Specific examples of readily available commercial lipases Include LIpoiase®, Lipolase® Ultra. 
Llpozyme®, Palatase®, Novozym® 435, Lecltase® (ail available from Novozymes A/S). 
25 Examples of other lipases are Lumafast®, Ps. mendocian lipase from Genencor Int Inc.; 
Lipomax®, Ps. pseudoaicaligenes lipase from Gist Brocades/Genencor Int Inc.; Fusarium 
solani lipase (cutinase) from Unilever, Bacillus sp. lipase from Solvay enzymes. Other lipases 
are available from other companies. 

it is to be understood that also lipase variants are contemplated as the parent enzyme. 
30 Examples of such are described In e.g. WO 93/01285 and WO 95/2261 5. 

The activity of the lipase can be detennined as described In "Methods of Enzymatic Analysis", 
Third Edition, 1984, Veriag Chemie, Weinheln. vol. 4, or as described In AF 95/5 GB (available 
on request from Novozymes A/S). 

35 Parent Oxidoreductases 

Parent oxidoreductases (i.e. en^mes classified under the Enzyme Classification number E.C. 
1 (Oxidoreductases) In accordance with the Recommendations (1992) of the International 
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Union of Biochemistry and Molecular Biology (lUBMB)) indude oxidoreductases within this 
group. 

Examples include oxidoreductases selected from those classified under the Enzyme Classi- 
fication (E.G.) numbers: 

Glycerol-3.phosphate dehydrogenase _NAD+_ (1.1.1 .8), Glycerol-3-phosphate 
dehydrogenase _NAD(P)+_ (1.1.1.94), Glycerol-3-phosphate 1 -dehydrogenase _NADP. 
(1.1.1.94). Glucose oxidase (1.1.3.4), Hexose oxidase (1.1.3.5), Catechol oxidase (1.1.3.14), 
Bilirubin oxidase (1.3.3.5), Alanine dehydrogenase (1.4.1.1), Glutamate dehydrogenase 
(1.4.1.2), Glutamate dehydrogenase _NAD(P)+_ (1.4.1.3), Glutamate dehydrogenase 
_NADP+_ (1.4.1.4), L-Amino acid dehydrogenase (1.4.1.5), Serine dehydrogenase (1.4.1,7). 
Valine dehydrogenase _NADP+_ (1.4.1.8), Leucine dehydrogenase (1.4.1.9), Glycine 
dehydrogenase (1.4.1.10), L-Amino-acid oxidase (1.4.3.2.), D-Amino-acid oxldase(1 .4.3.3), L- 
Giutamate oxidase (1.4.3.11), Protein-lysine 6-oxldase (1.4.3.13), L-lyslne oxidase (1.4.3.14), 
L-Aspartate oxidase (1.4.3.16), D-amlno-add dehydrogenase (1.4.99.1), Protein disulfide 
reductase (1.6.4.4), Thioredoxin reductase (1.6.4.5), Protein disulfide reductase (glutathione) 
(1.8.4.2), Laccase (1.10.3.2), Catalase (1.11.1.6), Peroxidase (1.11.1.7), Lipoxygenase 
(1.13.11.12), Superoxide dismutase (1.15.1.1) 

Said Glucose oxidases may be derived from Aspergillus niger. Said Laccases may be derived 
from Polyporus pinsitus, Myceliophtora thenmophila, Coprinus dnereus, Rhizoctonia solani, 
Rhizoctonia pratlcola, Scytalidium thermophilum and Rhus vemicifera. Bilirubin oxidases may 
be derived from Myrothechedum verrucaria. The Peroxidase may be derived from e.g. Soy 
bean. Horseradish or Coprinus cinereus. The Protein Disulfide reductases Protein Disulfide 
reductases of bovine origin, Protein Disulfide reductases derived from Aspergillus oryzae or 
Aspergillus niger, and DsbA or DsbC derived from Escherichia coll. 

Specific examples of readily available commercial oxidoreductases include Gluzyme (enzyme 
available from Novozymes A/S). However, other oxidoreductases are available from others. 
It is to be understood that also variants of oxidoreductases are contemplated as the parent 
enzyme. 

The activity of oxidoreductases can be determined as described In "Methods of Enzymatic 
Analysis", third edition, 1984, Veriag Chemie, Weinheim, vol. 3. 

Parent Carbohydrases 

Parent carbohydrases may be defined as all enzymes capable of breaking down carit)ohydrate 
chains (e.g. starches) of especially five and six member ring structures (i.e. enzymes classified 
under the Enzyme Classification number E.C. 3.2 (glycosidases) In accordance with the 
Recommendations (1992) of the I ntemationai Union of Biochemistry and Molecular Biology 
(lUBMB)). 
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Examples include carbohydrases selected from those classified under the En^e Classi- 
fication (E.C.) numt}ers: 

alfa-amylase (3.2.1.1) alfa-amylase (3.2.1.2), glucan 1.4-alfa-glucosida8e (3.2.1.3). cellulase 
(3.2.1.4), endo-1,3(4)-ljeta-glucanase (3.2.1.6), endo-1,4-beta-xylanase (3.2.1.8), dextranase 

5 (3.2.1.11), chitinase (3.2.1.14), polygalacturonase (3.2.1.15), iysozyme (3.2.1.17), beta- 
glucosidase (3.2.1.21). alfa-gaiactosldase (3.2.1.22), beta-galactosldase (3.2.1.23), amylo- 
1.6-glucosIdase (3.2.1.33), xylan 1,4-beta-xylosidase (3.2.1.37), glucan endo-1,3-beta-D- 
glucosldase (3.2.1.39), aifa-dextrin endo-1,6-glucoslda8e (3.2.1.41), sucrose alfa-glucosldase 
(3.2.1.48), glucan endo-1 ,3-alfa-glucosidase (3.2.1.59), glucan 1,4-beta-gluco8ldase 

10 (3.2.1 .74), glucan endo-1 ,6-beta-glucosIdase (3.2.1 .75), arabinan endo-1 .5-alfa-arablnosidase 
(3.2.1.99), lactase (3.2.1.108), and chltonanase (3.2.1.132). 

Specific examples of readily available commercial carbohydrases Include Alpha-Gal®, Bio- 
Feed® Alpha, Bio-Feed® Beta. Bio-Feed® Plus. Bio-Feed® Plus. Novozyme® 188, 
Carezyme®, Celluclast®. Cellusofl®, Ceremyl®, CItrozym®, Denlmax®, Dezyme®, 

15 Dextrozyme®, FInlzym®, Fungamyl®, Gamanase®, Glucanex®, Lactozym®, Maltogenase®, 
Pentopan®, Pectinex®, Promozyme®, Pulpzyme®. Novamyl®, Termamyl®, AMQ 
(Amyloglucosidase Novo). Maltogenase®. Aquazym®. Natalase® (all enzymes available from 
Novozymes A/S). Other carbohydrases are available from other companies. 
It Is to be understood that also carbohydrase variants are contemplated as the parent enzyme. 

20 The activity of carbohydrases can ije detennlned as described In "Methods of Enzymatic 
Analysis", third edition. 1984, Verlag Chemle. Welnhelm. vol. 4. 

Parent Transferases 

Parent transferases (i.e. enzymes classified under the Enzyme Classification number E.C. 2 in 
25 acconJance with the Recommendations (1992) of the Intemational Union of Biochemistry and 
Molecular Biology (lUBMB)) include transferases within this group. 

The parent transferases may be any transferase in the subgroups of transferases: transferases 
transfemng one-carbon groups (E.C. 2.1); transferases transfening aldehyde or residues (E.C 
2.2); acyltransferases (E.C. 2.3); glucosyltransferases (E.C. 2.4); transferases transfening 
30 alkyi or aryl groups, other that methyl groups (E.C. 2.5); transferases transfening nitrogeneous 
groups (2.6). 

In a prefenred embodiment the parent transferase Is a transglutaminase E.C 2.3.2.1 3(Pre»teln- 
glutamine beta-glutamyltransferase). 

Transglutaminases are enzymes capable of catalyzing an acyl transfer reaction In which a 
35 gamma-carboxyamide group of a peptide-bound glutamlne residue Is the acyl donor. Primary 
amino groups in a variety of compounds may function as a<^l acceptors with the subsequent 
fonmation of monosubstltuted gamma-amides of peptide-bound glutamic add. When the 
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epsilon-amino group of a lysine residue in a peptide^hain serves as the acyl acceptor, ttie 
transferases form intraniotecular or intermoiecuiar gamma-giutamyl-epsilon-lysyi crosslinlcs. 
The parent transglutaminase may be of human, animal (e.g. bovine) or microbial origin. 
Examples of such parent transglutaminases are animal derived Transglutaminase, FXIIIa; 

5 microbial transglutaminases derived from Physarum polycephalum (Klein et ai.. Journal of 
Bacteriology, Vol. 174, p. 2599-2605); transglutaminases derived from Streptomyces sp.. 
Including Streptomyces lavendulae, Streptomyces lydicus (fomier Streptomyces llbani) and 
Streptoverticillium sp.. including StreptoverticilHum mobaraense. Streptoverticillium cin- 
namoneum, and Streptoverticillium griseocameum (Motokl et ai., US 5,156,956; Andou et al., 

10 US 5,252.469; Kaempfer et ai., Journal of General Microbiology, Vol. 137, p. 1831-1892; OchI 
et ai.. International Journal of Sytematic Bacteriology. Vol. 44, p. 285-292; Andou et ai., US 
5,252,469; Williams et al., Journal of General Microbiology. Vol. 129, p. 1743-1813). 
It is to be understood that also transferase variants are contemplated as the parent enzyme. 
The activity of transglutaminases can be determined as described in "Methods of Enzymatic 

15 Analysis", third edition, 1984, Veriag Chemie, Weinheim, vol. 1-10. 

Parent Phytases 

Parent phytases are included in the group of enzymes classified under the Enzyme Classifica- 
tion number E.C. 3.1.3 (Phosphoric Monoester Hydrolases) in accordance with the 
20 Recommendations (1992) of the I ntemational U nion of Biochemistry and Molecular Biology 
(lUBMB)). 

Phytases are enzymes produced by microorganisms, which catalyse the conversion of phytate 
to inositol and inorganic phosphorus. 

Phytase producing microorganisms comprise bacteria such as Bacillus subtilis. Bacillus natto 
25 and Pseudomonas; yeasts such as Saccharomyces cerevlslae; and fungi such as Aspergillus 
niger, Aspergillus ficuum. Aspergillus awamori. Aspergillus oryzae, Aspergillus tenreus or 
Aspergillus nidulans. and various other Aspergillus species). 

Examples of parent phytases include phytases selected from those classified under the 
Enzyme Classification (E.C.) numbers: 3-phytase (3.1.3.8) and 6-phytase (3.1.3.26). 
30 The activity of phytases can be detemnined as described in "Methods of Enzymatic Analysis", 
third edition, 1984. Veriag Chemie, Weinheim, vol. 1-10, or may be measured according to the 
method described in EP-A1-0 420 358, Example 2 A. 

Lyases 

35 Suitable lyases include Polysaccharide lyases: Pectate lyases (4.2.2.2) and pecQn lyases 
(4.2.2.10), such as those from Bacillus licheniformis disclosed in WO 99/27083. 

Isomerases 
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Protein Disulfide isomerase. 

Without being limited thereto suitable protein disulfide Isomerases include PDIs described in 
WO 95/01425 (Novo Nordlsk A/S) and suitable glucose isomerases Include those described In 
Biotechnology Letter, Vol. 20. No 6. June 1998. pp. 553-56. 
5 Contemplated isomerases Include xylose/glucose Isomerase (5.3.1.5) Including Sweetzyme® 
(available from Novozymes A/S). 



AAaterlals and Methods 

Strains and olasmids 

1 0 E.coli DH1 2S (available from Gibco BRL) Is used for yeast piasmid rescue. 

pTMPP2ver2 Is a S. cerevisiae and E.coli shuttle vector under the control of TPI 
promoter, constructed from pJC039 described in WO 00/10038. It Is used for llbraiy 
construction, yeast expression, screening and sequencing. 

Saccharomyces cerevisiae YNG318: MATa Dpep4[clr+] ura3-52, Ieu2-D2, his 4-539 is 
15 used for the construction of yeast library and the expression of the fUslon protein. It Is 
described in J. Biol. Chem. 272 (15). 9720-9727. 1997). 



Media and substrates 
10X Basal solution 

20 66.8 g/L Yeast nitrogen base with out amino adds (DIFCO) 

100 g/L succinate 
60 g/L NaOH 



SC-glucose 

25 1 00 mL/L 20% glucose (i.e., a final concentration of 2% = 2 g/1 00ml)) 

4 mL/L 5% threonine 

1 0 mL/L 1 % tryptophan 

25 mL/L 20% casamino adds 

1 00 mL/L 1 0 X basal solution 

30 The above solution is sterilized using a filter of a pore size of 0.20 micro meters. Agar 

and H20 (approx. 761ml) Is autodaved together, and the separately sterilized SC-glucose 

solution is added to the agar solution. 



YPD 

35 20 g/L Bacto pepton 

1 0 g/L yeast extract 

1 00 mL/L 20% glucose (sterilized separately) 
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Na-Dhvtate Plate 

100 mUL 1 M Na acetate buffer (pH 5.5) 

5 g/L Na phytate 

5 30 gA. agar 

PEG/LIAc solution 

SOmL 40% PEG4000 (sterilized by autoclaving) 

1 mL 5M Lithium Acetate (sterilized by autoclaving) 

10 

Tra<?e M^tql SQimjop 
FeS04x7H20 13.90 g/L 

MnS04x5H20 13.60 g/L 

ZnCI2 6.80 g/L 

15 CuS04x5H20 2.50 g/L 

NiCI2x6H20 0.24 g/L 

Citric acid xH20 3.00 g/L 

pNPB 

20 p-nitrophenyi butyrate (SIGMA N-9876) 
Cutinase activltv (LU) 

A substrate for cutinase Is prepared by emulsifying tributyrin (glycerin tributyrate) using gum 
Arabic as emulsifler. Tine hydrolysis of tributyrin at SO^C at pH7 Is followed In a pH-stat titration 
25 experiment. One unit of cutinase activity (1LU) equals the amount of enzyme capable of 
releasing 1 micro mol butyric acid/min at the standard conditions. 

Phvtase activity assay 

10 micro L diluted enzyme samples (diluted In 0.1 M sodium acetate. 0.01 % Tween20, 

30 pH 5.5) were added into 250 micro L 5 miy/i sodium phytate (Sigma) In 0.1 M sodium acetate, 
0.01 % Tween20, pH 5.5 (pH adjusted after dissolving the sodium phytate; the substrate was 
preheated) and incubated for 30 minutes at 37"C. The reaction was stopped by adding 250 
micro L 10 % TCA and free phosphate was measured by adding 500 micro L 7.3 g FeS04 In 
ICQ mi molybdate reagent (2.5 g (NH4)6Mo7O24.4H20 In 8 ml H2S04 diluted to 250 mi). The 

35 absorbance at 750 nm was measured on 200 micro L samples In 96 well microtiter plates. 
Substrate and enzyme blanks were included. A phosphate standard curve was also included 
(0-2 mM phosphate). 1 PkT equals the amount of enzyme tiiat releases 1 micromole 
phosphate/min at the given conditions. 
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Examples 

Example 1: Construction of nucleic add sequence encoding fusion protein 
5 The cutlnase gene was amplified by PGR using the below primers AM34 (SEQ ID NO: 

1) and Cutl-R (SEQ ID NO: 2). The phytase gene together with the linker region was amplified 

by PGR using the primers Gutl-llnker-P (SEQ ID NO: 3) and AM35 (SEQ ID NO: 4). 

PGR Is carried out by the PTG-200 DNA Engine. DNA fragments are recovered from 

agarose gel by the QIagen gel extraction Kit. The resulted two fragments were Joined by SOE 
10 method (Splicing by Overlap Extension, see "PGR: A pracUcal approach", p. 207-209. Oxford 

University press, eds. McPherson, Quirice, Taylor). The PGR conditions are as follows: 



PGR reaction system: 


Gondltlons: 


38.9 micro L H20 


1 


98'>G 


10 sec 


5 micro L 1 0 X reaction buffer 


2 


68° G 


90 sec 


1 micro L Klen Taq LA (GLONTEGH) 


1-2 


30 cycles 




4 micro L 10 mM dNTPs 


3 


68«>G 


lOmIn 


0.3micro L X 2 1 00 pmole/mlcro L Primers 








0.5 micro L Template DNA 









The resulting fragments were gel-purifled and used for the template for the second 
15 PGR reaction. 



PGR reaction system: 


Gondltlons: 


38.4 micro L H20 


1 


98« 


G 


5 micro L 1 0 X reaction buffer 




10 sec 




1 micro L Wen Taq (GLONTEGH) 


2 


66«» 


G 


4 micro L 10 ml\/i dNTPs 




90 sec 




0.3micro L X 2 1 00 pmole/micro L Primers 


1-2 


30 (^cles 




O.Smicro L X 2 P GR fragments 


3 




G 






lOmIn 





Example 2: Transfonmation and expression of the fusion orot ein In S. cerevisiae 
20 The S. cerevisiae transformants were obtained by the following procedure: 

1 . Mix O.Smicro L of vector (Xba I digested) and 1 micro L of PGR fragments obtained 
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In Example 1. 

2. Thaw YNG318 competent cells on Ice and use as host cell. 

3. Mix lOOmicro L of the cells, the DMA mixture from step 1 above and lOmicro L of 
canier DNA (Clontech) In 12ml polypropylene tubes (Falcon 2059). 

5 4. Add O.emI PEG/LIAc solution and mix gently. 

5. Incubate for 30mln at 30^C, and 200 rpm. 

6. Incubate for 30 min at 42^C (heat shock). 

7. Transfer to an eppendorf tube and centrifuge for 5 sec. 

8. Remove the supematant and resolve sediment in 3mi of YPD. 
0 9. Incubate the cell suspension for 45 min at 200 rpm at 30^C. 

10. Pour ttie suspension to SC-glucose plates and Incubate at 30^C for 3days. 
The obtained transfonnants were cultivated in YPD medium in 24 well plates at 25^C 
for 3 days at 1 SOrpm. 

The plates were centrifuged and the supematant was assayed for cutinase activity and 
15 phytase activity. 



Transformant No 


Cutinase activity 
(LU/ml) 


Phytase activity 
(FYT/ml) 


Ratio 
(FYT/LU) 


1 


4.36 


7.00 


1.6 


2 


2.59 


4.65 


1.8 


3 


4.29 


6.25 


1.5 


4 


0.68 


1.65 


2.4 


5 


2.87 


5.00 


1.7 
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The transfonnants showed both cutinase and phytase activity meaning that the fusion 
protein was secreted and folded properly as an active form. Further the table shows that the 
activity ratio Is at a constant level and that the two enzymes consequently must be co- 
expressed as a fused enzyme. 



Example 3: Comparison of relative specific activitv using ohvtase variants 

Two Idnds of fusion protein using two phytase variants genes in combination with one 

cutinase were constructed by the method as described in example 1. The two phytase variants 
25 are denoted Variant N and Variant X. 

The specific activity ratio of Variant N to Variant X is 100:180. The cutinase and 

phytase activities of several transfonnants were measured. 



Cutinase 


Phytase variant 


Cutinase 


activity 


Phytase 


activity 


Ratio (FYT/LU) 


variant 




(LU/ml) 




(FYT/ml) 
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X/Ariflnt A 


Variant N 

vol lai li 1^ 


1 65 


0 68 


0,41 


\/orlonf A 


Variant N 
vol lai It 1^ 


4 65 


2 59 


0.56 


\/orifint A 

vcniani M 


\/Arianf N 
vai lai li 1^ 


fi 25 

w«*»w 


4 29 

^•A»W 


0.69 


Variani M 


X/ariant M 
vai lai II 1^ 


7 00 

f *ww 


4 36 


0.62 


Vcinanl M 


Variant N 

VallCilH v% 


5 00 

W»WW 


2 87 

A*W f 


0.57 


V/orlant A 
Variani M 


Variant N 


8 00 

w«ww 


5 32 

W> Wfc 


0.67 


Vanani M 


\/ariant V 
vai icini /\ 


10 00 


11 15 

1 1 • 1 w 


1.12 


X/orlanf A 


Variant X 
vai lai 11 /\ 


7 30 

r aWW 


9 31 


1.28 


Vanani r\ 


Vol idni /x 


1 W* WW 


17 64 

• f •W" 


1.36 


Variant A 


Variant X 


9.00 


10.91 


1.21 


Variant A 


Variant X 


3.75 


3.70 


0.99 


Variant A 


Variant X 


2.50 


2.93 


1.17 


Variant A 


Variant X 


8.75 


11.88 


1.36 



Data shows that the activity-ratio is at a constant level for each of the two variant 
combinations and that the two enzymes consequently must be co-expressed as a fused 
enzyme. Further the ratio between FYT/LU for the two fused enzymes systems Is dose to the 
5 expected value of 1.8, and it can consequently be concluded that a change In specific activity 
can be monitored by measuring the activity ratio of the fused enzyme. 



Examole 4: Other linker (FLAG) 

The fusion protein with another linker Fi^G (DYKDDDK) was constructed using ttie same 
10 method as described in the example 1. The following primers were used for SOE: AM34 (SEQ 
ID NO: 1), Cuti-R (SEQ ID NO: 2), fiMiS (SEQ ID NO:4). and Cuti-FLAG-P (SEQ ID NO:5). 

The transformant was cultured and assayed for cutlnase and phytase activities as 
described In example 2. 

15 



Linker 


Cutinase activity 

(LU/ml) 


Phytase activity 
(FYT/ml) 


Ratio 
(FYT/LU) 


FLAG 


3.63 


5.18 


1.43 


PEPTPEPT 


4.31 


6.22 


1.44 



The ratio between phytese activity and cutinase activity Is constent, and at the same 
level as in example 2. it can consequentiy be concluded that the two enzymes are co- 
expressed as a fused enzyme independent of choice of linker. 

20 
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Example S: Hloh through out screening 

Relative activities of cutinase and phytase activities were measured in the same well of 
96-well micro titer plates using the following method. In this example the two proteins as 
described in the example 3 were used (variant A-t- variant X. variant A+ variant N). 
ly/lethod: 

1. Add 2.5 micro L of samples at several concentrations of the fusion protein in a 96-weti micro 
plate. 

2. Add 100 micro L of substrate solution. 

Substrate solution: 

A. 1 mi of 3 mg/ml pNPB dissolved In 2-propanol. 

B. 10 ml of 1 mg/ml Na-phytate solution dissolved in 0AM acetate buffer (pH 5.75) 
ly/lix A and B Just before experiment. 

3. Incubate at room temperature for 10 minutes. 

4. Measure A405 (Cutinase activity) 

5. Add 100 micro L of stop solution (7.3 g FeS04 In 100 ml molybdate reagent (2.5 g 
(NH4)6Mo7O24.4H20 In 8 ml H2S04 diluted to 250 mi) and keep the plate still for 10 minutes. 

6. Measure A750 (Phytase activity) and calculate the ratio (A750/A405). 



Well 
No. 


Cutinase 
variant 


Phytase 
variant 


A405 


A750 


Ratio (A750/A405) 


1 


Variant A 


Variant N 


0.155 


0.108 


0.70 


2 


Variant A 


Variant N 


0.243 


0.162 


0.67 


3 


Variant A 


Variant N 


0.295 


0.201 


0.68 


4 


Variant A 


Variant N 


0.385 


0.299 


0.78 


5 


Variant A 


Variant X 


0.175 


0.193 


1.10 


6 


Variant A 


Variant X 


0.251 


0.252 


1.00 


7 


Variant A 


Variant X 


0.319 


0.298 


0.93 


8 


Variant A 


Variant X 


0.387 


0.397 


1.03 



The table shows that the relative each enzyme activity is possible to be measured in one well 
of 96-well micro titre plates and that variants with Improved specific activity can be screened. 
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